New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Refactor LibriSpeech Conformer RNN-T recipe #2366

Closed

hwangjeff wants to merge 4 commits into pytorch:main from hwangjeff:librispeech_conformer_rnnt_refactor

Contributor

hwangjeff commented May 6, 2022 •

edited

Loading

Modifies the example LibriSpeech Conformer RNN-T recipe as follows:

Moves data loading and transforms logic from lightning module to data module (improves generalizability and reusability of lightning module and data module).
Moves transforms logic from dataloader collator function to dataset (resolves dataloader multiprocessing issues on certain platforms).
Replaces lambda functions with partial equivalents (resolves pickling issues in certain runtime environments).
Modifies training script to allow for specifying path model checkpoint to restart training from.


          Refactor LibriSpeech Conformer RNN-T recipe

7e55ba0

facebook-github-bot added the CLA Signed label

nateanl reviewed

View reviewed changes

examples/asr/librispeech_conformer_rnnt/train.py Outdated

-                  trainer.fit(model)
+                  model = ConformerRNNTModule(str(args.sp_model_path))
+                  data_module = get_data_module(str(args.librispeech_path), str(args.global_stats_path), str(args.sp_model_path))
+                  trainer.fit(model, data_module)

Member

nateanl May 6, 2022 •

edited

Loading

It'll be great if we add a --resume-checkpoint argument with None as default value, and change this line to

trainer.fit(model, data_module, ckpt_path=args.resume_checkpoint)

Contributor

xiaohui-zhang May 11, 2022 •

edited

Loading

--resume-from-checkpoint may sound better


          lint and partial

0078c68

hwangjeff force-pushed the librispeech_conformer_rnnt_refactor branch from 0ec7ce4 to 0078c68 Compare

May 6, 2022 20:34


          allow for restarting training from checkpoint; annotate required args

3196a2e

hwangjeff marked this pull request as ready for review

May 11, 2022 01:49

hwangjeff requested review from mthrok, nateanl, xiaohui-zhang and carolineechen

May 11, 2022 01:50

mthrok approved these changes

View reviewed changes

Collaborator

mthrok left a comment

Stamp

examples/asr/librispeech_conformer_rnnt/data_module.py Outdated

		from pytorch_lightning import LightningDataModule, seed_everything


		seed_everything(1)

Collaborator

mthrok May 11, 2022

Is this the right place to seed? I would imagine this happen once (and only once) at the very beginning of the CLI entry point.

xiaohui-zhang reviewed

View reviewed changes

examples/asr/librispeech_conformer_rnnt/data_module.py Outdated

		seed_everything(1)


		def _batch_by_token_count(idx_target_lengths, token_limit, sample_limit=None):

Contributor

xiaohui-zhang May 11, 2022

Can you change sample_limit to batch_size and token_limit to max_tokens per our previous discussion? It's ok if you want to do that in another PR.

xiaohui-zhang reviewed

View reviewed changes

examples/asr/librispeech_conformer_rnnt/data_module.py

+                      return dataloader
+                  def test_dataloader(self):
+                      dataset = torchaudio.datasets.LIBRISPEECH(self.librispeech_path, url="test-clean")

Contributor

xiaohui-zhang May 11, 2022

better make the eval split customizable as we discussed. again I can do that in a later PR as well.

xiaohui-zhang reviewed

View reviewed changes

examples/asr/librispeech_conformer_rnnt/data_module.py Outdated

+                      train_transform,
+                      val_transform,
+                      test_transform,
+                      max_token_limit=700,

Contributor

xiaohui-zhang May 11, 2022

max_tokens should be fine. "max" duplicates with "limit"

xiaohui-zhang reviewed

View reviewed changes

examples/asr/librispeech_conformer_rnnt/data_module.py Outdated

+                      val_transform,
+                      test_transform,
+                      max_token_limit=700,
+                      sample_limit=2,

Contributor

xiaohui-zhang May 11, 2022

batch_size


          address feedback

8f08ebe

Contributor

facebook-github-bot commented May 11, 2022

@hwangjeff has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot closed this in

69467ea

github-actions bot commented May 11, 2022

Hey @hwangjeff.
You merged this PR, but labels were not properly added. Please add a primary and secondary label (See https://github.com/pytorch/audio/blob/main/.github/process_commit.py)

hwangjeff added improvement example other labels

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed example improvement other